Strength of the purifying selection against different categories of the point mutations in the coding regions of the human genome.
نویسندگان
چکیده
Using available Information on the total absolute size of the coding region of the human genome, data on codon usage and pseudogene-derived mutation rates for different single nucleotide substitutions we have estimated, for the human genome, the potential numbers of mutation events capable to produce: (1) nonsense; (2) missense (radical and conservative); (3) silent; (4) splice; and (5) protein-elongating (those changing wild-type stop codon into an amino acid encoding codon) mutations. We used the NCBI dbSNP database to retrieve data on the observed number of polymorphisms of each category. The fraction of polymorphisms in each category among all potential events in the genome depends on the strength of selection: the higher the rate of polymorphism, the weaker the selection. We used nonsense mutations as a referent group. Compared with nonsense mutations, we found that the relative selection coefficient against protein-elongating mutations was 21%, and the relative selection was 12% against missense mutations. Radical missense mutations were found to be four times more deleterious compared to conservative ones. Surprisingly, we found that silent mutations on average are not neutral; with the average harmfulness of 3% of nonsense mutations. Silent mutations may be deleterious when they affect splicing by creating cryptic donor-acceptor sites or by disturbing exonic splicing enhancers (ESESs). The average selection coefficient against splice mutations was 48% of that against nonsense mutations. Converting the relative selection coefficients into absolute ones using data on loss-of-function mutations in Saccharomyces cerevisiae and Caenorhabditis elegans, or by analysis of the expected frequency of mutations in the human genome, suggested that genetic drift could play a role in population dynamics of conservative missense and silent mutations.
منابع مشابه
Phylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملکاوش ژنومی برای ردیابی نشانههای انتخاب در اسب نژاد ترکمن
Abstract Selection not only increases the frequency of new-useful mutations but also remains some signals throughout the genome. Since these areas are often control economically important traits, identifying and tracking these areas is the most important issue in the animal genetics. The aim of this study was to detecting signals of selection in the genome of Turkmen horse using 70K SNP chip...
متن کاملDetection of Pre-treatment mutations leading to resistance to direct hepatitis C virus blocking drugs in patients with chronic hepatitis C
Background and objective: Human is the only host of hepatitis C virus. This virus has a positive single stranded RNA and lipoprotein envelop that has 7 confirmed genotypes. According to studies, genotypes 1a, 3a and 1b are the most common genotypes in Iran. No effective vaccine against HCV infection has been developed instead, advances in antiviral treatment using drugs that directly affect spe...
متن کاملLong non-coding RNAs and their significance in human diseases
Protein-coding genes account for only a small fraction of the human genome and most of the genomic sequences are transcriptionally silent, but recent observations indicate significant functional elements, including non-coding protein transcripts in the human genome. Long non-coding RNAs (lncRNAs) have been defined as transcripts of >200 nucleotides without protein-coding capacity that perform t...
متن کاملInvestigating single nucleotide polymorphism (SNP) density in the human genome and its implications for molecular evolution.
We investigated the single nucleotide polymorphism (SNP) density across the human genome and in different genic categories using two SNP databases: Celera's CgsSNP, which includes SNPs identified by comparing genomic sequences, and Celera's RefSNP, which includes SNPs from a variety of sources and is biased toward disease-associated genes. Based on CgsSNP, the average numbers of SNPs per 10 kb ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Human molecular genetics
دوره 15 7 شماره
صفحات -
تاریخ انتشار 2006